3574 results found.
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Use:
-
Paper title:Do you Feel Certain about your Annotation? A Web-based Semantic Frame Annotation Tool Considering Annotators’ Concerns and Behaviors
-
Paper track:Evaluation/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Regina Stodden | SemEval 2019 Task 2 Corpus | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Bilingual
Languages:
Burmese English
Availability:
From Owner
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Size:
84057 entries Production Status:
Newly created-finished
Use:
Lexicon Creation/Annotation
-
Paper title:A Myanmar (Burmese)-English Named Entity Transliteration Dictionary
-
Paper track:Terminology/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chenchen Ding | Myanmar-English Transliteration Data Set | /N |
Documentation:
English READMI zipped with the data
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English Finnish Turkish
Availability:
From Owner
License:
Size:
617k -- 2.9M types words Production Status:
Existing-used
Use:
Morphological Analysis
-
Paper title:Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and Pruning
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Stig-Arne Grönroos | Morpho Challenge 2010 dataset | /N |
Documentation:
English
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:Multi-domain Tweet Corpora for Sentiment Analysis: Resource Creation and Evaluation
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mamta . | Multi-domain Tweet Sentiment Corpora | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
n/a
Size:
14,100 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:I Feel Offended, Don’t Be Abusive! Implicit/Explicit Messages in Offensive and Abusive Language
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tommaso Caselli | OLID | /N |
Documentation:
https://arxiv.org/abs/1902.09666
Written
Corpus,
Language Type:
Bilingual
Languages:
English Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
19,600 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:I Feel Offended, Don’t Be Abusive! Implicit/Explicit Messages in Offensive and Abusive Language
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tommaso Caselli | HatEval | /N |
Documentation:
https://www.aclweb.org/anthology/S19-2007/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
3000 sentences Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Extracting Adherence Information from Electronic Health Records
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jordan Sanders | EHR Adherence | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
70000 entries Production Status:
Obtained from "Everyday Sexism Project"
Use:
Information Extraction, Information Retrieval
-
Paper title:Semi-supervised Multi-task Learning for Multi-label Fine-grained Sexism Classification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Harika Abburi | Unlabeled Accounts of Sexism | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
681,288 entries Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Semi-supervised Multi-task Learning for Multi-label Fine-grained Sexism Classification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Harika Abburi | Blog Authorship Dataset | /N |
Documentation:
Yes.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
13000 entries Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Semi-supervised Multi-task Learning for Multi-label Fine-grained Sexism Classification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Harika Abburi | Multi-label Sexism Account Categorization Dataset (EMNLP 2019) | /N |
Documentation:
The documentation is available publicly in English.




